Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 287
Filtrar
1.
Nat Med ; 30(4): 958-968, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38641741

RESUMO

Causal machine learning (ML) offers flexible, data-driven methods for predicting treatment outcomes including efficacy and toxicity, thereby supporting the assessment and safety of drugs. A key benefit of causal ML is that it allows for estimating individualized treatment effects, so that clinical decision-making can be personalized to individual patient profiles. Causal ML can be used in combination with both clinical trial data and real-world data, such as clinical registries and electronic health records, but caution is needed to avoid biased or incorrect predictions. In this Perspective, we discuss the benefits of causal ML (relative to traditional statistical or ML approaches) and outline the key components and steps. Finally, we provide recommendations for the reliable use of causal ML and effective translation into the clinic.


Assuntos
Tomada de Decisão Clínica , Aprendizado de Máquina , Humanos , Causalidade , Resultado do Tratamento , Registros Eletrônicos de Saúde
2.
NPJ Parkinsons Dis ; 10(1): 58, 2024 Mar 13.
Artigo em Inglês | MEDLINE | ID: mdl-38480700

RESUMO

Characterization of Parkinson's disease (PD) progression using real-world evidence could guide clinical trial design and identify subpopulations. Efforts to curate research populations, the increasing availability of real-world data, and advances in natural language processing, particularly large language models, allow for a more granular comparison of populations than previously possible. This study includes two research populations and two real-world data-derived (RWD) populations. The research populations are the Harvard Biomarkers Study (HBS, N = 935), a longitudinal biomarkers cohort study with in-person structured study visits; and Fox Insights (N = 36,660), an online self-survey-based research study of the Michael J. Fox Foundation. Real-world cohorts are the Optum Integrated Claims-electronic health records (N = 157,475), representing wide-scale linked medical and claims data and de-identified data from Mass General Brigham (MGB, N = 22,949), an academic hospital system. Structured, de-identified electronic health records data at MGB are supplemented using a manually validated natural language processing with a large language model to extract measurements of PD progression. Motor and cognitive progression scores change more rapidly in MGB than HBS (median survival until H&Y 3: 5.6 years vs. >10, p < 0.001; mini-mental state exam median decline 0.28 vs. 0.11, p < 0.001; and clinically recognized cognitive decline, p = 0.001). In real-world populations, patients are diagnosed more than eleven years later (RWD mean of 72.2 vs. research mean of 60.4, p < 0.001). After diagnosis, in real-world cohorts, treatment with PD medications has initiated an average of 2.3 years later (95% CI: [2.1-2.4]; p < 0.001). This study provides a detailed characterization of Parkinson's progression in diverse populations. It delineates systemic divergences in the patient populations enrolled in research settings vs. patients in the real-world. These divergences are likely due to a combination of selection bias and real population differences, but exact attribution of the causes is challenging. This study emphasizes a need to utilize multiple data sources and to diligently consider potential biases when planning, choosing data sources, and performing downstream tasks and analyses.

3.
medRxiv ; 2024 Feb 18.
Artigo em Inglês | MEDLINE | ID: mdl-38405736

RESUMO

Characterization of Parkinson's disease (PD) progression using real-world evidence could guide clinical trial design and identify subpopulations. Efforts to curate research populations, the increasing availability of real-world data and recent advances in natural language processing, particularly large language models, allow for a more granular comparison of populations and the methods of data collection describing these populations than previously possible. This study includes two research populations and two real-world data derived (RWD) populations. The research populations are the Harvard Biomarkers Study (HBS, N = 935), a longitudinal biomarkers cohort study with in-person structured study visits; and Fox Insights (N = 36,660), an online self-survey-based research study of the Michael J. Fox Foundation. Real-world cohorts are the Optum Integrated Claims-electronic health records (N = 157,475), representing wide-scale linked medical and claims data and de-identified data from Mass General Brigham (MGB, N = 22,949), an academic hospital system. Structured, de-identified electronic health records data at MGB are supplemented using natural language processing with a large language model to extract measurements of PD progression. This extraction process is manually validated for accuracy. Motor and cognitive progression scores change more rapidly in MGB than HBS (median survival until H&Y 3: 5.6 years vs. >10, p<0.001; mini-mental state exam median decline 0.28 vs. 0.11, p<0.001; and clinically recognized cognitive decline, p=0.001). In the real-world populations, patients are diagnosed more than eleven years later (RWD mean of 72.2 vs. research mean of 60.4, p<0.001). After diagnosis, in real-world cohorts, treatment with PD medications is initiated 2.3 years later on average (95% CI: [2.1-2.4]; p<0.001). This study provides a detailed characterization of Parkinson's progression in diverse populations. It delineates systemic divergences in the patient populations enrolled in research settings vs. patients in the real world. These divergences are likely due to a combination of selection bias and real population differences, but exact attribution of the causes is challenging using existing data. This study emphasizes a need to utilize multiple data sources and to diligently consider potential biases when planning, choosing data sources, and performing downstream tasks and analyses.

5.
bioRxiv ; 2024 Feb 16.
Artigo em Inglês | MEDLINE | ID: mdl-38405764

RESUMO

Genomics for rare disease diagnosis has advanced at a rapid pace due to our ability to perform "N-of-1" analyses on individual patients. The increasing sizes of ultra-rare, "N-of-1" disease cohorts internationally newly enables cohort-wide analyses for new discoveries, but well-calibrated statistical genetics approaches for jointly analyzing these patients are still under development.1,2 The Undiagnosed Diseases Network (UDN) brings multiple clinical, research and experimental centers under the same umbrella across the United States to facilitate and scale N-of-1 analyses. Here, we present the first joint analysis of whole genome sequencing data of UDN patients across the network. We apply existing and introduce new, well-calibrated statistical methods for prioritizing disease genes with de novo recurrence and compound heterozygosity. We also detect pathways enriched with candidate and known diagnostic genes. Our computational analysis, coupled with a systematic clinical review, recapitulated known diagnoses and revealed new disease associations. We make our gene-level findings and variant-level information across the cohort available in a public-facing browser (https://dbmi-bgm.github.io/udn-browser/). These results show that N-of-1 efforts should be supplemented by a joint genomic analysis across cohorts.

7.
Nat Commun ; 14(1): 6403, 2023 10 12.
Artigo em Inglês | MEDLINE | ID: mdl-37828001

RESUMO

Rare Mendelian disorders pose a major diagnostic challenge and collectively affect 300-400 million patients worldwide. Many automated tools aim to uncover causal genes in patients with suspected genetic disorders, but evaluation of these tools is limited due to the lack of comprehensive benchmark datasets that include previously unpublished conditions. Here, we present a computational pipeline that simulates realistic clinical datasets to address this deficit. Our framework jointly simulates complex phenotypes and challenging candidate genes and produces patients with novel genetic conditions. We demonstrate the similarity of our simulated patients to real patients from the Undiagnosed Diseases Network and evaluate common gene prioritization methods on the simulated cohort. These prioritization methods recover known gene-disease associations but perform poorly on diagnosing patients with novel genetic disorders. Our publicly-available dataset and codebase can be utilized by medical genetics researchers to evaluate, compare, and improve tools that aid in the diagnostic process.


Assuntos
Pacientes , Doenças Raras , Humanos , Simulação por Computador , Fenótipo , Doenças Raras/diagnóstico , Doenças Raras/genética
8.
Artigo em Inglês | MEDLINE | ID: mdl-37350904

RESUMO

Diabetes is associated with heterogeneous behaviors affecting patients' clinical characteristics and trajectories. This study includes 21,288 patients with type 2 diabetes (women, ages 30 to 65). The cohort was filtered through a set of preprocessing heuristics in order to assure the cohort exhibited a similar clinical trajectory. Anomalous characteristics were then identified using dimensionality reduction and anomaly detection methods. Compared to the majority of the cohort, patients classified as anomalous were twice as likely to be admitted into the hospital (7.94[7.59 8.28] versus 3.12[3.06 3.17] times), have a higher incidence of comorbidities (2[1.64 2.36] times more), and be prescribed more insulin and less new and more expensive diabetes medications (such as Sodium glucose co-transporter 2 inhibitors). Patients with these anomalous characteristics may benefit from additional or specialized interventions to avert their risk for adverse outcomes.

9.
Psychiatry Res ; 323: 115175, 2023 05.
Artigo em Inglês | MEDLINE | ID: mdl-37003169

RESUMO

Growing evidence has shown that applying machine learning models to large clinical data sources may exceed clinician performance in suicide risk stratification. However, many existing prediction models either suffer from "temporal bias" (a bias that stems from using case-control sampling) or require training on all available patient visit data. Here, we adopt a "landmark model" framework that aligns with clinical practice for prediction of suicide-related behaviors (SRBs) using a large electronic health record database. Using the landmark approach, we developed models for SRB prediction (regularized Cox regression and random survival forest) that establish a time-point (e.g., clinical visit) from which predictions are made over user-specified prediction windows using historical information up to that point. We applied this approach to cohorts from three clinical settings: general outpatient, psychiatric emergency department, and psychiatric inpatients, for varying prediction windows and lengths of historical data. Models achieved high discriminative performance (area under the Receiver Operating Characteristic curve 0.74-0.93 for the Cox model) across different prediction windows and settings, even with relatively short periods of historical data. In short, we developed accurate, dynamic SRB risk prediction models with the landmark approach that reduce bias and enhance the reliability and portability of suicide risk prediction models.


Assuntos
Serviço Hospitalar de Emergência , Tentativa de Suicídio , Humanos , Tentativa de Suicídio/psicologia , Reprodutibilidade dos Testes , Curva ROC
12.
PLoS One ; 18(1): e0266985, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36598895

RESUMO

PURPOSE: In young adults (18 to 49 years old), investigation of the acute respiratory distress syndrome (ARDS) after severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection has been limited. We evaluated the risk factors and outcomes of ARDS following infection with SARS-CoV-2 in a young adult population. METHODS: A retrospective cohort study was conducted between January 1st, 2020 and February 28th, 2021 using patient-level electronic health records (EHR), across 241 United States hospitals and 43 European hospitals participating in the Consortium for Clinical Characterization of COVID-19 by EHR (4CE). To identify the risk factors associated with ARDS, we compared young patients with and without ARDS through a federated analysis. We further compared the outcomes between young and old patients with ARDS. RESULTS: Among the 75,377 hospitalized patients with positive SARS-CoV-2 PCR, 1001 young adults presented with ARDS (7.8% of young hospitalized adults). Their mortality rate at 90 days was 16.2% and they presented with a similar complication rate for infection than older adults with ARDS. Peptic ulcer disease, paralysis, obesity, congestive heart failure, valvular disease, diabetes, chronic pulmonary disease and liver disease were associated with a higher risk of ARDS. We described a high prevalence of obesity (53%), hypertension (38%- although not significantly associated with ARDS), and diabetes (32%). CONCLUSION: Trough an innovative method, a large international cohort study of young adults developing ARDS after SARS-CoV-2 infection has been gather. It demonstrated the poor outcomes of this population and associated risk factor.


Assuntos
COVID-19 , Síndrome do Desconforto Respiratório , Humanos , Adulto Jovem , Idoso , Adolescente , Adulto , Pessoa de Meia-Idade , COVID-19/complicações , COVID-19/epidemiologia , SARS-CoV-2 , Estudos de Coortes , Estudos Retrospectivos , Registros Eletrônicos de Saúde , Síndrome do Desconforto Respiratório/etiologia , Síndrome do Desconforto Respiratório/complicações , Obesidade/complicações
13.
J Am Soc Nephrol ; 34(2): 309-321, 2023 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-36368777

RESUMO

BACKGROUND: The National Kidney Foundation and American Society of Nephrology Task Force on Reassessing the Inclusion of Race in Diagnosing Kidney Disease recently recommended a new race-free creatinine-based equation for eGFR. The effect on recommended clinical care across race and ethnicity groups is unknown. METHODS: We analyzed nationally representative cross-sectional questionnaires and medical examinations from 44,360 participants collected between 2001 and 2018 by the National Health and Nutrition Examination Survey. We quantified the number and proportion of Black, White, Hispanic, and Asian/Other adults with guideline-recommended changes in care. RESULTS: The new equation, if applied nationally, could assign new CKD diagnoses to 434,000 (95% confidence interval [CI], 350,000 to 517,000) Black adults, reclassify 584,000 (95% CI, 508,000 to 667,000) to more advanced stages of CKD, restrict kidney donation eligibility for 246,000 (95% CI, 189,000 to 303,000), expand nephrologist referrals for 41,800 (95% CI, 19,800 to 63,800), and reduce medication dosing for 222,000 (95% CI, 169,000 to 275,000). Among non-Black adults, these changes may undo CKD diagnoses for 5.51 million (95% CI, 4.86 million to 6.16 million), reclassify 4.59 million (95% CI, 4.28 million to 4.92 million) to less advanced stages of CKD, expand kidney donation eligibility for 3.96 million (95% CI, 3.46 million to 4.46 million), reverse nephrologist referral for 75,800 (95% CI, 35,400 to 116,000), and reverse medication dose reductions for 1.47 million (95% CI, 1.22 million to 1.73 million). The racial and ethnic mix of the populations used to develop eGFR equations has a substantial effect on potential care changes. CONCLUSION: The newly recommended 2021 CKD-EPI creatinine-based eGFR equation may result in substantial changes to recommended care for US patients of all racial and ethnic groups.


Assuntos
Insuficiência Renal Crônica , Adulto , Humanos , Creatinina , Taxa de Filtração Glomerular , Inquéritos Nutricionais , Estudos Transversais , Insuficiência Renal Crônica/diagnóstico
14.
EClinicalMedicine ; 55: 101724, 2023 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-36381999

RESUMO

Background: While acute kidney injury (AKI) is a common complication in COVID-19, data on post-AKI kidney function recovery and the clinical factors associated with poor kidney function recovery is lacking. Methods: A retrospective multi-centre observational cohort study comprising 12,891 hospitalized patients aged 18 years or older with a diagnosis of SARS-CoV-2 infection confirmed by polymerase chain reaction from 1 January 2020 to 10 September 2020, and with at least one serum creatinine value 1-365 days prior to admission. Mortality and serum creatinine values were obtained up to 10 September 2021. Findings: Advanced age (HR 2.77, 95%CI 2.53-3.04, p < 0.0001), severe COVID-19 (HR 2.91, 95%CI 2.03-4.17, p < 0.0001), severe AKI (KDIGO stage 3: HR 4.22, 95%CI 3.55-5.00, p < 0.0001), and ischemic heart disease (HR 1.26, 95%CI 1.14-1.39, p < 0.0001) were associated with worse mortality outcomes. AKI severity (KDIGO stage 3: HR 0.41, 95%CI 0.37-0.46, p < 0.0001) was associated with worse kidney function recovery, whereas remdesivir use (HR 1.34, 95%CI 1.17-1.54, p < 0.0001) was associated with better kidney function recovery. In a subset of patients without chronic kidney disease, advanced age (HR 1.38, 95%CI 1.20-1.58, p < 0.0001), male sex (HR 1.67, 95%CI 1.45-1.93, p < 0.0001), severe AKI (KDIGO stage 3: HR 11.68, 95%CI 9.80-13.91, p < 0.0001), and hypertension (HR 1.22, 95%CI 1.10-1.36, p = 0.0002) were associated with post-AKI kidney function impairment. Furthermore, patients with COVID-19-associated AKI had significant and persistent elevations of baseline serum creatinine 125% or more at 180 days (RR 1.49, 95%CI 1.32-1.67) and 365 days (RR 1.54, 95%CI 1.21-1.96) compared to COVID-19 patients with no AKI. Interpretation: COVID-19-associated AKI was associated with higher mortality, and severe COVID-19-associated AKI was associated with worse long-term post-AKI kidney function recovery. Funding: Authors are supported by various funders, with full details stated in the acknowledgement section.

15.
Inflamm Bowel Dis ; 29(5): 695-704, 2023 05 02.
Artigo em Inglês | MEDLINE | ID: mdl-35786768

RESUMO

BACKGROUND: With an increasing number of therapeutic options available for the management of ulcerative colitis (UC), the variability in treatment and prescribing patterns is not well known. While recent guidelines have provided updates on how these therapeutic options should be used, patterns of long-term use of these drugs over the past 2 decades remain unclear. METHODS: We analyzed a retrospective, nationwide cohort of more than 1.7 million prescriptions for trends in prescribing behaviors and to evaluate practices suggested in guidelines relating to ordering biologics, step-up therapy, and combination therapy. The primary outcome was 30-day steroid-free remission and secondary outcomes included hospitalization, cost, and additional steroid usage. A pipeline was created to identify cohorts of patients under active UC medical management grouped by prescribing strategies to evaluate comparative outcomes between strategies. Cox proportional hazards and multivariate regression models were utilized to assess postexposure outcomes and adjust for confounders. RESULTS: Among 6 major drug categories, we noted major baseline differences in patient characteristics at first exposure corresponding to disease activity. We noted earlier use of biologics in patient trajectories (762 days earlier relative to UC diagnosis, 2018 vs 2008; P < .001) and greater overall use of biologics over time (2.53× more in 2018 vs 2008; P < .00001) . Among biologic-naive patients, adalimumab was associated with slightly lower rates of remission compared with infliximab or vedolizumab (odds ratio, 0.92; P < .005). Comparisons of patients with early biologic initiation to patients who transitioned to biologics from 5-aminosalicylic acid suggest lower steroid consumption for early biologic initiation (-761 mg prednisone; P < .001). Combination thiopurine-biologic therapy was associated with higher odds of remission compared with biologic monotherapy (odds ratio, 1.36; P = .01). CONCLUSIONS: As biologic drugs have become increasingly available for UC management, they have increasingly been used at earlier stages of disease management. Large-scale analyses of prescribing behaviors provide evidence supporting early use of biologics compared with step-up therapy and use of thiopurine and biologic combination therapy.


Population-scale analysis reveals patterns in prescribing trends for ulcerative colitis management. Findings include (1) earlier use of biologics in patient trajectories, (2) associations of step-up therapy with higher corticosteroid exposure, and (3) association of combination therapy with positive patient outcomes.


Assuntos
Produtos Biológicos , Colite Ulcerativa , Humanos , Colite Ulcerativa/tratamento farmacológico , Estudos Retrospectivos , Infliximab/uso terapêutico , Adalimumab/uso terapêutico , Fatores Biológicos/uso terapêutico , Fatores Imunológicos/uso terapêutico , Produtos Biológicos/uso terapêutico
16.
J Am Med Inform Assoc ; 29(11): 1870-1878, 2022 10 07.
Artigo em Inglês | MEDLINE | ID: mdl-35932187

RESUMO

OBJECTIVE: This study aimed is to: (1) extend the Integrating the Biology and the Bedside (i2b2) data and application models to include medical imaging appropriate use criteria, enabling it to serve as a platform to monitor local impact of the Protecting Access to Medicare Act's (PAMA) imaging clinical decision support (CDS) requirements, and (2) validate the i2b2 extension using data from the Medicare Imaging Demonstration (MID) CDS implementation. MATERIALS AND METHODS: This study provided a reference implementation and assessed its validity and reliability using data from the MID, the federal government's predecessor to PAMA's imaging CDS program. The Star Schema was extended to describe the interactions of imaging ordering providers with the CDS. New ontologies were added to enable mapping medical imaging appropriateness data to i2b2 schema. z-Ratio for testing the significance of the difference between 2 independent proportions was utilized. RESULTS: The reference implementation used 26 327 orders for imaging examinations which were persisted to the modified i2b2 schema. As an illustration of the analytical capabilities of the Web Client, we report that 331/1192 or 28.1% of imaging orders were deemed appropriate by the CDS system at the end of the intervention period (September 2013), an increase from 162/1223 or 13.2% for the first month of the baseline period, December 2011 (P = .0212), consistent with previous studies. CONCLUSIONS: The i2b2 platform can be extended to monitor local impact of PAMA's appropriateness of imaging ordering CDS requirements.


Assuntos
Sistemas de Apoio a Decisões Clínicas , Idoso , Diagnóstico por Imagem , Humanos , Medicare , Monitorização Fisiológica , Reprodutibilidade dos Testes , Estados Unidos
18.
Transl Psychiatry ; 12(1): 154, 2022 04 11.
Artigo em Inglês | MEDLINE | ID: mdl-35410453

RESUMO

Schizophrenia affects >3.2 million people in the USA. However, its comorbidity patterns have not been systematically characterized in real-world populations. To address this gap, we conducted an observational study using a cohort of 86 million patients in a nationwide health insurance dataset. We identified participants with schizophrenia and those without schizophrenia matched by age, sex, and the first three digits of zip code. For each phenotype encoded in phecodes, we compared their prevalence in schizophrenia patients and the matched non-schizophrenic participants, and we performed subgroup analyses stratified by age and sex. Results show that anxiety, posttraumatic stress disorder, and substance abuse commonly occur in adolescents and young adults prior to schizophrenia diagnoses. Patients aged 60 and above are at higher risks of developing delirium, alcoholism, dementia, pelvic fracture, and osteomyelitis than their matched controls. Type 2 diabetes, sleep apnea, and eating disorders were more prevalent in women prior to schizophrenia diagnosis, whereas acute renal failure, rhabdomyolysis, and developmental delays were found at higher rates in men. Anxiety and obesity are more commonly seen in patients with schizoaffective disorders compared to patients with other types of schizophrenia. Leveraging a large-scale insurance claims dataset, this study identified less-known comorbidity patterns of schizophrenia and confirmed known ones. These comorbidity profiles can guide clinicians and researchers to take heed of early signs of co-occurring diseases.


Assuntos
Diabetes Mellitus Tipo 2 , Transtornos Psicóticos , Esquizofrenia , Adolescente , Comorbidade , Análise de Dados , Diabetes Mellitus Tipo 2/epidemiologia , Feminino , Humanos , Transtornos Psicóticos/psicologia , Esquizofrenia/diagnóstico , Esquizofrenia/epidemiologia
19.
Lancet ; 398(10316): 2093-2100, 2021 12 04.
Artigo em Inglês | MEDLINE | ID: mdl-34756184

RESUMO

BACKGROUND: Many countries are experiencing a resurgence of COVID-19, driven predominantly by the delta (B.1.617.2) variant of SARS-CoV-2. In response, these countries are considering the administration of a third dose of mRNA COVID-19 vaccine as a booster dose to address potential waning immunity over time and reduced effectiveness against the delta variant. We aimed to use the data repositories of Israel's largest health-care organisation to evaluate the effectiveness of a third dose of the BNT162b2 mRNA vaccine for preventing severe COVID-19 outcomes. METHODS: Using data from Clalit Health Services, which provides mandatory health-care coverage for over half of the Israeli population, individuals receiving a third vaccine dose between July 30, 2020, and Sept 23, 2021, were matched (1:1) to demographically and clinically similar controls who did not receive a third dose. Eligible participants had received the second vaccine dose at least 5 months before the recruitment date, had no previous documented SARS-CoV-2 infection, and had no contact with the health-care system in the 3 days before recruitment. Individuals who are health-care workers, live in long-term care facilities, or are medically confined to their homes were excluded. Primary outcomes were COVID-19-related admission to hospital, severe disease, and COVID-19-related death. The third dose effectiveness for each outcome was estimated as 1 - risk ratio using the Kaplan-Meier estimator. FINDINGS: 1 158 269 individuals were eligible to be included in the third dose group. Following matching, the third dose and control groups each included 728 321 individuals. Participants had a median age of 52 years (IQR 37-68) and 51% were female. The median follow-up time was 13 days (IQR 6-21) in both groups. Vaccine effectiveness evaluated at least 7 days after receipt of the third dose, compared with receiving only two doses at least 5 months ago, was estimated to be 93% (231 events for two doses vs 29 events for three doses; 95% CI 88-97) for admission to hospital, 92% (157 vs 17 events; 82-97) for severe disease, and 81% (44 vs seven events; 59-97) for COVID-19-related death. INTERPRETATION: Our findings suggest that a third dose of the BNT162b2 mRNA vaccine is effective in protecting individuals against severe COVID-19-related outcomes, compared with receiving only two doses at least 5 months ago. FUNDING: The Ivan and Francesca Berkowitz Family Living Laboratory Collaboration at Harvard Medical School and Clalit Research Institute.


Assuntos
Vacina BNT162 , COVID-19/prevenção & controle , Imunização Secundária , Eficácia de Vacinas , Adulto , Idoso , COVID-19/epidemiologia , COVID-19/virologia , Feminino , Humanos , Israel/epidemiologia , Masculino , Vacinação em Massa , Pessoa de Meia-Idade , Pandemias/prevenção & controle , Prognóstico , SARS-CoV-2
20.
NPJ Breast Cancer ; 7(1): 147, 2021 Nov 29.
Artigo em Inglês | MEDLINE | ID: mdl-34845230

RESUMO

Histopathologic evaluation of biopsy slides is a critical step in diagnosing and subtyping breast cancers. However, the connections between histology and multi-omics status have never been systematically explored or interpreted. We developed weakly supervised deep learning models over hematoxylin-and-eosin-stained slides to examine the relations between visual morphological signal, clinical subtyping, gene expression, and mutation status in breast cancer. We first designed fully automated models for tumor detection and pathology subtype classification, with the results validated in independent cohorts (area under the receiver operating characteristic curve ≥ 0.950). Using only visual information, our models achieved strong predictive performance in estrogen/progesterone/HER2 receptor status, PAM50 status, and TP53 mutation status. We demonstrated that these models learned lymphocyte-specific morphological signals to identify estrogen receptor status. Examination of the PAM50 cohort revealed a subset of PAM50 genes whose expression reflects cancer morphology. This work demonstrates the utility of deep learning-based image models in both clinical and research regimes, through its ability to uncover connections between visual morphology and genetic statuses.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA